seq list 



SEQUENCE LISTING 

<110> Ago, Hideo 

Mi.yano, Masashi 
Adachi , Tsuyoshi 

<12 0> HCV Polymerase Suitatle for Crystal 

Structure Analysis and Method for Using the Enzyme 



<13 0> SHIM007 

<14 0> 0 9/608, 713 
< 1 4 1 > 2000-00-30 

<150> 11-188030 
<151> 1999-07-02 

<150> 11-192438 
<151> 1999-07-07 

<160> 13 

<170> FastSEQ for Windows Version 4.0 

<210> 1 
<21I> 591 
<212> PRT 

< 2 1 3 > Hepatitis C Virus 



<4 0 0> 1 



Ser 


Met 


Ser 


Tyr 


Thr 


Trp 


Thr 


Gly 


Ala 


Leu 


lie 


Thr 


Pro' 


Cys 


Ala 


Ala 


1 








5 










10 










15 




Glu 


Glu 


Ser 


Lys 

2 0 


Leu 


Pro 


He 


Asn 


Ala 
25 


Leu 


Ser 


Asn 


Ser 


Leu 
3 0 


Leu 


Arg 


His 


Hie 


Asn 

35 


Met 


Val 


Tyr 


Ala 


Thr 

40 


Thr 


Ser 


Arg 


Ser 


Ala 
4 5 


Gly 


Leu 


Arg 


Gin 


Lys 
50 


Lys 


Val 


Thr 


Phe 


Asp 

55 


Arg 


Leu 


Gin 


Val 


Leu 
6 0 


Asp 


Asp 


His 


Tyr 


Arg 


Asp 


Val 


Leu 


Lys 


Glu 


Met 


Lys 


Ala 


Lys 


Ala 


Ser 


Thr 


Val 


Lys 


Ala 


65 










7 0 










7 5 










3 0 


Lys 


Leu 


Leu 


Ser 


Val 
85 


Glu 


Glu 


Ala 


Cys 


Lys 
9 0 


Leu 


Thr 


Pre 


Pro 


His 
95 


Ser 


Ala 


Lys 


Ser 


Lys 

100 


Phe 


Gly 


Tyr 


Gly 


Ala 

105 


Lys 


Asp 


Val 


Arg 


Asn 

110 


Leu 


Ser 


Ser 


Lys 


Ala 
115 


Val 


Asn 


His 


He 


His 
120 


Ser 


Val 


Trp 


Lys 


Asp 
125 


Leu 


Leu 


Glu 
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seq list 

Asp Thr Val Thr Pro He Asp Thr Thr He Met Ala Lys Asn Glu Val 

13 0 13 5 1 -1 0 

Phe Cys Val Gin Pr: Glu Lys Gly Gly Arc Lys Pro Ala Arg Leu He 
145 150 155 160 

Val Phe Pro Asp Leu Gly Val Arg Val Cys Glu Lys Met Ala Leu Tyr 

* 165 " 17 j ' 175 

Asp Val Val Ser Thr Leu Pre Gin Val Val Met Gly Ser 3er Tyr Gly 

100 135 190 

Phe Gin Tyr Ser Pr: Gly GH Arg Val Glu Phe Leu Val Asn Thr Trp 

19 5 3 0 0 5 0 5 

Lys Ser Lys Lys Asn Pro Met Gly Phe Ser Tyr Asp Thr Arg Cys Phe 

3 10 315 23 0 

Asp Ser Thr' Val Thr Glu Asn Asp He Arg Val Glu Glu Ser lie Tyr 
225 230 " 235 240 

Gin Cys Cys Asp Leu Ala Pro Glu Ala Arg Gin Ala He Lys Ser Leu 

2-; 5 350 5 55 

Thr Glu Arg Leu Tyr lie Gly Gly Pro Leu Thr Asn Ser Lys Gly Gin 

2 6 0 2 6 5 2 7 0 

Asn Cys Gly Tyr Arg Ara Cys Arg Ala Ser Gly Val Leu Thr Thr Ser 

275 " 280 235 

Cys Gly Asn Thr Leu Thr Cys Tyr Leu Lys Ala Ser Ala Ala Cys Arg 

2 90 2 95 3 00 

Ala Ala Lys Leu Gin Asp Cys Thr Met: Leu Val Asn Gly Asp Asp Leu 
305 310 315 320 

Val Val He Cys Glu Ser Ala Gly Thr Gin Glu Asp Ala Ala Ser Leu 

32 5 3 3 0 3 3 5 

Arg Val Phe Thr Glu Ala Met Thr Arg Tyr Ser Ala Pro Pro Gly Asp 

3 4 0 3 4 5 3 5 0 

Pro Pro Gin Pro Glu Tyr Asp Leu Glu Leu lie Thr Ser Cys Ser Ser 

3 5 5 3 6 0 3 6 5 

Asn Val Ser Val Ala His Asp Ala Ser Gly Lys Arg Val Tyr Tyr Leu 

37 0 37 5 38 0 

Thr Arg Asp Pro Thr Thr Pro Leu Ala Arg Ala Ala Trp Glu Thr Ala 
385 " ' 390 " 395 400 

Arg His Thr Pro Val Asn Ser Trp Leu Gly Asn He He Met Tyr Ala 

4 05 410 415 

Pro Thr Leu Trp Ala Arg Met He Leu Met Thr His Phe Phe Ser He 

420 42 5 430 

Leu Leu Ala Gin Glu Gin Leu Glu Lys Ala Leu Asp Cys Gin Lie Tyr 

43 5 4 40 '4 45 

Gly Ala Cys Tyr Ser He Glu Pro Leu Asp 2eu Pro Gin He He Glu 

4 50 4 5 5 4 6 0 

Arg Leu His Gly Leu Ser Ala Phe Ser Leu His Ser Tyr Ser Pro Gly 
465 470 475 430 

Glu He Asn Arg Val Ala Ser Cys Leu Arg Lys Leu Gly Val Pro Pro 

48 5 4 90 4 95 

Leu Arg Val Trp Arg His Arg Ala Arg Ser Val Arg Ala Arg Leu Leu 
5 0 0 5 0 5 5 1 0 

Eaae 2 



s e q list 

Ser Gin Gly Gly Arg Ala Ala Thr Cys Gly Lys Tyr Leu Phe Asn Trp 

515 " 52 0 52 5 

Ala Val Lys Thr Lys Leu Lys Leu Thr Pro He Pro Ala Ala Ser Gin 

53 0 535 540 

Leu Asr Leu Ser Gly Trp Phe Val Ala Gly Tyr Ser Gly Gly Asp He 
54 5 * * 550 555 5 60 

Tyr His Ser Lou Ser Arg Ala Arg Pro Arg Trp Phe Mel Leu Cys Leu 

065 i7g 575 

Leu Leu Leu Ser Val Gly Val Gly lie Tyr Leu Leu Pro Asn Arg 
5 >'•■"] ^83 0 90 



< 0 1 0 > 2 
<211:> 2 889 
<212^> I'.'MA 

<2 1 3 > Art i fie : al Sequence 

< 2 2 0 > 
<221> CDS 

<222> ( 1 ) . . . ( 1743) 

<223> E'lJA encodirig fusion protein consist in of a portion 
of HCV polymerase and histidine tag at the 
G-termmus 

<2 2 1 ■> mi s c_f eature 

<222> 144,~132, 2 14, 294, 312, 560, 622, 71 6, 776, 939, 1007, 1017, 
1031, 1095, i:83, 1149, 1270, 1349, 1405, 1408, 1501, 1588, 
1604, 1679, 1824, 1899, 2076, 2086, 2221, 2225, 2240, 2313, 
244 5, 2 605, 2oj4, 2760 
<223> n = A, T, G or G 

<22 1 ^> tt l .i_ s c f ea tu re 

<222> 144,~152, 1679, 1824, 1899, 2076, 2086, 2221, 2225, 2240, 
2313, 2445, 26015, 2634 , 2760 
<223 > n = A, T, C ir G 

<223> LNA encoding fusion protein consisting of a 

portion of HCV polymerase and histidine tag at the 
C- terminus 

<22 1 > mi. sc_f eature 

<222:> 144,~152, 214, 294, 312, 560, 622, 716, 77-, 939, 1007, 1017, 
1031, 1 095, llc3, 1 249, 1 270, 1.34 9, 1405, 1408, 1501, 1538, 
1604, 1679, 1-24, 1899, 2076, 2086, 2221, 2225, 2240, 2313, 
2445, 2605, 2634, 2760 
<223> n - A, T, G or G 

<221> misc feature 
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seq list 

<222> 144, 152, 214, 294, 312, 5 60, 62 2, 716, 776, 939, 1007, 1017, 
1031, 1095, 1183, 1249, 1270, 134 9, '.405, 1403, 1 501 , 1538, 
1604 , 1 679, 1.824, 1399, 2076, 2036, 2121, 222 5, 2240, 2313, 
2445, 2 6 05, 2634, 2760 
<223> r. = A, T, C or G 

< 4 0 0 > 2 

atg tea atg toe tac aca tgg aca ggc gee ttg ale acg cca tgc get 48 

Met Ser Met Ser Tyr Thr Trp Thr Gly Ala Leu lie Thr Pro Cys Ala 

1 5 10 15 

mts rmt srt yrt hrt rth rgy aau thr rcy saa gog gag gaa age aag 96 

Xaa Xaa Xaa Ma a Xaa Xaa Xaa Asn Xaa Xaa Xaa Ala Glu Glu Ser Lys 

1: 0 ; 5 3 0 

ctg eee ate aae gcg teg age aac tet ttg etg aag ugu sry sur asn 144 

Leu Pro lie Asn Ala Leu Ser Asn Ser Leu Leu Lys Cys Xaa Xaa Xaa 
3 5 4 0 4 5 

aau sra sns ruu ege eae eat aae atg gtt tat gee aea aea tet ege 192 

Asn Xaa Xaa Xaa Arg ills His Asn Met Val Tyr Ala thr Thr Ser Arg 
5 0 5 5 60 

age gea ggc etg arg hsh sas nmt vat yra ath rth rsr arg sra agy 240 

Ser Ala Gly Leu Xaa Xaa Xaa Xaa Xaa Xaa Lie Xaa Xaa Xaa Xaa Xaa 

65 70 75 30 

ucg gea gaa gaa ggt eae etc tga eag aet gea agt eet gga ega eea 288 

Ser Ala Glu Glu Gly His Leu * Gin Thr Ala Ser Pro Gly Arg Pro 

3 5 90 9 5 

ear ggn ysy sva thr has arg ugn vau asa shs tac egg gae gtg etc 336 

Gin Gly Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Tyr Arg Asp Val Leu 

100 lu5 " 11.0 

aag gag atg aag gcg aag gcg tec aca gtt aag tyr arg asv auy sgu 384 

Lys Glu Met Lys Ala Lys Ala Ser Thr Val Lys Xaa Xaa Xaa Xaa Xaa 
115 110 12 5 

mty saa ysa asr thr vay sge taa aet eet ate egt aga gga age ctg 432 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa * Thr Pro lie Arg Arg Gly Ser Leu 
130 135 140 

caa get gae gsc ccc aca taa ysu usr vag ugu aac ysy sut hrr rhs 480 

Gin Ala Asp Ala Pro Thr * Xaa Xaa Xaa Cys Asn Xaa Xaa Xaa Xaa 
145 150 155 

teg gee aaa tee aag ttt ggc tat ggg gea aag gae gte egg aae eta 528 
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Ser Ala Lys 

16 0 

sra ays sry 
Xaa Xaa Xaa 

1.7 5 

aac cac ate 
Asn His He 
190 

hss rva try 
Xaa Xaa Xaa 



tgg eaa aaa 
Trp Gin Lys 



ttt etg tgt 
Phe Leu Cys 



hey sva gnr 
Xaa Xaa Xaa 



gag tec gtg 
Glu Ser Val 



guy smt aau 
Xaa Xaa Asn 

135 

tec tea Lac 
Ser Ser Tyr 

30 0 

eag tac tet 
Gin Tyr Ser 
315 

yrs rrg ygn 
Xaa Xaa Xaa 



tgg get ttt 



Ser Lys Phe 



shg yty rgy 
Xaa Xaa Xaa 



cac tec gtg 
His S e r V a 1 
: 95 

sas uug aag 
Xaa Leu Lys 

: ": 1 0 

atg agg uas 
Met Arg Xaa 

2 2 5 

cca acc aga 
Pro Thr Arg 

2 4 0 

guy sgy gya 
Xaa Xaa Xaa 
2 55 

tat gcg aga 
Tyr Ala Arg 
270 

tat gat gtg 
Tyr Asp Val 

tyr asv ava 
Xaa Xaa Xaa 



cct qgg cag 
Pro Gly Gin 
32 0 

arg vag uhu 
Xaa Xaa Xaa 

33 5 

cat atg aca 



soq 
Gly Tyr Gly 
165 

a ay sas vaa 
Asn Xaa Xaa 

8 o 

tgg aag gac 
Trp Lys Asp 



aca etg tga 
Thr Leu * 



thr vat hrr 
Xaa Xaa Xaa 



gaa agg agg 
Glu Arg Arg 



rgy sra aar 
Xaa Xaa Lys 



aga tgg cce 
Arg Trp Pre 
275 

gtc tec acc 
Val Ser Thr 
2 90 

srt hru rgri 
Xaa Xaa Xaa 

305 

ega gtc gag 
Arg Val Glu 



vaa snt hrt 
Xaa Xaa Xaa 



etc get gtt 



ist 

Ala Lys Asp 



rga snu tec 
X a a Xaa Ser 

185 

1 1 g etg srs 
Leu Leu Xaa 

2 0 0 

cac caa ttg 
His Gin Leu 
215 

ast hrt hrm 
Xaa Xaa Xaa 
2 30 

ccg taa gee 
Pre * Ala 
2 4 5 

qua teg tat 
Val Ser Tyr 

2 6 0 

tcv ahr asu 
Xaa Xaa Xaa 



ett cct cag 
Leu Pro Gin 



vav acit gys 
Xaa X a a Xaa 
3 1 0 

ttc ::tg gtg 
Phe Leu Val 
325 

gga aat caa 
Gly Asn Gin 
3 4 0 

rys sry sys 



Val Arg Asn 

17 0 

age aag gee 
Ser Lys Ala 



rys aav aas 
Xaa Xaa Xaa 



aca cca cca 
Thr Pro Pro 



taa ysa sng 
* Xaa X a a 



age ccg cct 
Ser Pro Pro 



tec cag ate 
Ser Gin lie 
265 

gyv aar gva 
Xaa Lys Xaa 

2 3 0 

gtc gtg atg 
Val Val Met 
295 

rsr tyr gga 
Xaa Xaa Gly 



aat acc gyh 
Asn Thr Xaa 



aga aaa acc 
Arg Lys Thr 
34 5 

asn rmt gyh 



Leu 



gtt 57 6 
Val 



nhs 62 4 
X a a 

2 0 5 

tea 672 

Ser 

220 

ugt 72 0 

Xaa 

2 35 

tva 7 63 
Xaa 

2 50 

tgg 816 
Trp 

cys 3 64 
Xaa 



ggc 912 
Gly 



ttc 960 
Phe 



gnt 10 0 8 

Xaa 

3 30 

cca 10 56 
Pro 



srt 1104 
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Trp Ala Phe 

yra sth rar 
Xaa Xaa Xaa 

3 65 

tga qga gec 
* (.My Val 



aat gtc gtg 
Asn Val Val 
395 

ncy scy sas 
Xaa Xaa Xaa 
410 

egg ggg tec 
Arg Gly Ser 
42 5 

hra sns rys 
Xaa Xaa Xaa 
440 

tgc tga cga 
Cys * Arg 
455 

thr age tgc 
Xaa Ser Cys 
470 

tgt sre ysg 
Cys Xaa Xaa 



gaa get :ca 
Glu Ala Pre 



sug :ias cys 
Xaa Xaa Xaa 

52 0 

egg gaa ccc 



His Met Thr 
3 50 

gey stt cga 

Xaa Xaa Arg 



aat tha ssr 
Asn Xaa Xaa 



act z g g c c c 
Thr Trp Pro 



uaa rgu aaa 
* Xaa Lys 



tct gac taa 
Ser Asp * 



gyc aga act 
Xaa Arg Thr 



ctg nas ncy 
Leu Xaa Xaa 



ggt aac aec 
Gly Asn Thr 
4 7 5 

yas nth rut 
Xaa Xaa Xaa 

4 90 

gga ctg :a-: 
Gly Leu His 
50 5 

thr ntu vaa 
Xaa Xaa Xaa 



aag agg azg 



seq 
Leu Ala Val 

0 z ) c 

etc aac ggt 
Leu Asn Gly 

37 0 

thr vat hrg 

Add Add Add 

?. y 5 

ccg aag cca 
Pro Lys Pro 
4 0 0 

rgg riaa yss 

Xaa Add Add 

415 

ttc aaa agg 
Phe Lys Arg 
4 30 

gcg gtt ate 
Ala Vdl He 
4 4 5 

sgy tyr arg 
Xaa Ada Xaa 
4 6 0 

etc aca tgt 
Leu Thr Cys 



hrc yst yru 

Add Add Add 



gat get cgt 
Asp Ala Arg 

blh 

sng yas ase 
Xaa Xaa Xaa 

52 5 

egg cga gcu 



1st 

X a a X a a X a a 

cac cga gaa 
His Arg Glu 



uas nas arg 
Xaa Xaa Xaa 



gac agg cca 
Asp Arg Pro 
•l 0 5 

ret cac aga 
Xaa His Arg 



gut hrg uar 
Arg Xaa Ma a 



gee ggt gee 
Ala GLy Ala 

4 5 0 

arg cys arg 
Xaa Xaa Xaa 
4 6 5 

t a c 1 1 g a a g 
Tyr Leu Lys 
4 3 0 

ysa asr aaa 
Xaa Xaa Lys 
4 95 

gaa egg aga 
Glu Arg Arg 



teg teg tta 
Ser Ser Leu 



vav acy sgu 



Xaa Xaa Xaa 

360 

cga cat ccg 
Arg His Pro 
.J 7 5 

vag ugu srt 
Xaa Cys Xaa 

3 9 0 

taa aat cgt 
+ Asn Arg 



gcg get tta 
Ala Ala Leu 
4 2 0 

gut yrg ygy 
X a a Xaa X a a 
4 35 

gcg cga gcg 
Ala Arg Ala 



aas rgy vau 

Add Xdd Xdd 



gec tct gcd 
Ala Ser Ala 



acy scg age 
Xaa Xdd Ser 

500 

cga car gaa 
A r j Gin Glu 
515 

tct gtg aaa 
Ser Val Lys 
530 

sra agy thr 



Xaa 



tgt 1152 
Cys 



ace 12 On 
Thr 



yrg 124 3 
Xaa 



tat 1296 
Tyr 



rut 134 4 
Xaa 



gcg 13 92 
Ala 



thr 1440 
Xaa 



gec 148 3 

Ala 

4 35 

tgc 153o 
Cys 



a a y 15 8 4 
Asn 



gcg 1632 
Ala 



gng 168 0 
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Arg Glu Pro Lys Arg Thr Arg Arg Ala Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
535 540 545 



uas aaa asr eta cga gtc ttc acg gag get atg act agg :ac tec gee 
Xaa Lys Xaa Leu Arg Va I Phe Thr GLu Ala Met Thr Arg Cyr Ser Ala 

5 5 0 5 5 5 5 6 0 5 6 5 



172! 



ecc ggg ear gva nthrguaamt thrargtyrs 
Pre Gly Xaa Xaa 
57 0 



~aa rrgyga z eegeeeeaae 



17 0 3 



cagaat acga 
ssrtccaatg 
vaaahsasaa 
gctgcgtggg 
tccagt taac 
asnmtt yrge 
hrutraaarg 
gaetgeeaga 
gecac: tgac 
atgge ct tag 
rtyrsrrggt 
snargvaaas 
tccgcgctag 
ggggagggee 
yystyruhas 
raavaysthr 
tacagegggg 
tetegt gece 
shshscaeca 



ct tggagctg 
tgteggtege 
srgyysargv 
agacauthra 
tecegge tag 
gceeact t tg 
mtumtthrhs 
tcuuaagngu 
ctacctcaga 
c g e a 1 1 1 1 e a 
gagatcaa ta 
reysuargys 
getaruargv 
geeaettgt g 
ntgggcagtg 
ysuysuthr r 
gagaegnuas 
gaccccgcgg 
t cactaataa 



ataaeatcat 
e eacgatgea 
a tyrtyrctc 
rgasrchr th 
geaacat tat 
tgggcaagga 
hhs ra t cct t 
gnuguysaau 
tcatttyrgy 
etccatagtt 
gggtggctte 
ugyvarcee t 
a trarghsar 
geaagtaect 
aagaccaaac 
raaaasreag 
asrgytrhva 
accceatcac 
hshshs 



gttccas rrg 
t eaggeaaaa 
ace eg t gat-:: 
r ruaaargaa 
tatgtataaa 
tgattetga t 
e tagegeagg 
ascysgntae 
aaeys ty rs r 
ac tcteeagu 
atgeeteagg 
tgegagtetg 
gaaargsrva 
et teaaeus r 
t eaaae t eac 
etggaet tgt 
aagytyrs rg 
cattyrhs sr 



rirgutyrasa 
gggtgtaeta 
e eaeeaeeee 
aatrgutnrg 
rghs thr rva 
gaetcae t. tc 
ageaac t tga 
ggggeetgt t 
guruasurgn 
arguhsgyus 
aaaettgggg 
gagaca tegg 
argaaargue 
gngygyarga 
teeaateeeg 
eeggetggtt 
ygyasa t a t a 
usrargaaar 



guuthrsrey 1343 
esrasnvasr 19 03 
retegcaegg 19*53 
etagacaeae 202 3 
asnsrtrugy 2 0 0 3 
ttcteeaart 2143 
aaaagecctg 22 0 3 
aetccat tga 22 63 
gaaegactee 2 32 3 
raahsruhss 2303 
taccagygua 24 4 3 
gceaggageg 2 503 
tgtcceaggg 2563 
aaathrcysg 2623 
ge tgegtee t 2 60 3 
egttgctggt 2743 
teaeagcetg 2 8 0 3 
grarggysrh 28 63 
2889 



<210> 3 
<211> 5 70 
<212> PRT 

<213> Artificial Sequence 



<220> 

<221> VARIANT 

<222> 17, 10, 19, 20, 2 1, 22, 2 3, 25, 26, 27, 46, 47, 43, 50, 51, 
52, 69, 70, 71, 72, 73, 74, 76, 77, 73, 79, 80, 93, 99, 
100, 101, 102, 103, 104, 105, 106, 123, 124, 125, 126, 127, 
128, 129, 130, 131 , 132, 133, 134, 149, 1 5 0 , 151, 154 
<223> Xaa = Any Amino Acid 

<22 1> VARIANT 

<222> 155, li.6, 1 57, 1 74 , 1 75, 1 76, 177, 1 78, 179, 1 31 , 1 32, 183, 
184, 201, 202, 203, 204, 205, 20 6, 207, 208, 209, 226, 227, 
228, 229, 23U, 231, 232, 233, 234, 235, 250, 251, 252, 253, 
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s q list 

254, 255, 256, 257, 25?, 276, 277, 278, 279, 281, 282 
<223> Xaa - Any Amino Acid 

<221> VARIANT 

<222> 2-5, 284, 5 02, 3u3, 304, 305, 300, 307, 308, 309, 310, 311, 

312, 31 S 35..), 331, 332, 333, 334, 335, 336, 337, 338, 339, 

35o, 35'', 358 , 359, 360, 361, 362, 363, 364, 365, 365, 367, 

38 3, 5 ~ 3 , 3 84, 3 8 5, 38 6, 3 87, 3 8 - , 3 - 9, 3 90, 3 92, 4 <■■::■ 
<2 2 3> Xaa = Any Amino Acid 



<2 2i> variant 

<222> 409, 410, 411, 4:2, 414, 415, 416, 417, 433, 4 34, 435, 436, 
437, 4 38, 439, 440, 441 , 442, 458, 459, 4 60, 4 61 , 4 62, 4 63, 
464, 4 63, 466, 467, 4 66, 4 69, 47'), 487, 488, 489, 490, 4^1, 
492, 4 93, 494, 495, 496, 4 98, 499, 516, 51 9, 52 0, 521 

<2 2 3> Ma a = Any Amino Arid 



<221> VARIANT 

<222> 522, 523, 524, 525, 526, 543, 544, 545, 546, 547, 548, 549, 

550, 552, 569, 570 

<223> Xaa = Any Amino Acid 

<223> E'NA encoding fusion protein consisting of a 

portion of HCV polymerase and histidine tag at the 
C- terminus 

<400> 3 

Met Ser Met Ser Tyr Thr Trp Thr Gly Ala Leu lie Thr Pro Cys Ala 

1 5 10 15 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Asn Xaa Xaa Xaa Ala Glu Glu Ser Lys 



-I 



( ) 



Leu Pro lie Asn Ala Leu Ser Asn Ser Leu Leu Lys Cys Xaa Xaa Xaa 

35 4 0 " 4 5 

Asn Xaa Xaa Xaa Arg His His Asn Met Val Tyr Ala Thr Thr Ser Arg 

5 0 5 5 6 0 

Ser Ala Gly Leu Xaa Xaa Xaa Xaa Xaa Xaa Lie Xaa Xaa Xaa Xaa Xaa 

65 7 0 7 5 8 0 

Ser Ala Glu Glu Gly His Leu Gin Thr Ala Ser Pro Gly Arg Pro Gin 

6 5 9 0 95 

Gly Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Tyr Arg Asp Val Leu Lys 

1 0 0 1 0 5 1 1 0 

Glu Met Lys Ala Lys Ala Ser Thr Val Lys Xaa Xaa Xaa Xaa Xaa Xaa 

115 120 125 

Xaa Xaa Xaa Xaa Xaa Xaa Thr Pro lie Arg Arg Gly Ser Leu Gin Ala 

130 135 140 

Asp Ala Pro Thr Xaa Xaa Xaa Cys Asn Xaa Xaa Xaa Xaa Ser Ala Lys 

145 150 155 160 

Ser Lys Phe Gly Tyr Gly Ala Lys Asp Val Arg Asn Leu Xaa Xaa Xaa 
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Xaa Xaa Xaa Asn 

180 

His Ser Val Trp 
195 

Xaa leu Lys Thr 
210 

Arg Xaa Xaa Xaa 

22 5 

Arg Glu Arg Arg 

Xaa Xaa Lys Val 
2 60 

Arg Trp Pro Xaa 
275 

Val Ser Thr Leu 

290 

Xaa Xaa Xaa Xaa 
305 

Arg Val Glu Phe 

Xaa Xaa Xaa Gly 
34 0 

Leu Ala Val Xaa 
355 

Leu Asn Gly His 
370 

Xaa Xaa Xaa Xaa 
385 

Lys Pro Asp Arg 

Xaa His Arg Ala 
42 0 

Xaa Xaa Xaa Xaa 

435 

Gly Ala Ala Arg 
450 

Xaa Xaa Xaa Xaa 
465 

Lys Ala Ser Ala 

Lys Xaa Xaa Ser 

50 0 

Arg Arg Gin Glu 
515 

Leu Ser Val Lys 
530 

Xaa Xaa Xaa Xaa 



165 

Xaa Xaa Xaa Xaa 

Lys Asp Leu Leu 

2 00 

Leu His Gin Leu 
215 

Xaa Xaa Xaa Xaa 

230 

Pro Ala Ser Pro 
245 

Ser Tyr Ser Gin 

Xaa Xaa Xaa Lys 

2 8 0 

Pro Gin Val Val 
2 95 

Xaa Xaa Xaa Xaa 
310 

Leu Val Asn Thr 
325 

Asn Gin Arg Lys 

Xaa Xaa Xaa Xaa 

360 

Arg Glu Arg His 
375 

Xaa Xaa Gys Xaa 
390 

Pro Asn Arg Xaa 
405 

Ala Leu Tyr Arg 

Xaa Xaa Xaa Xaa 
44 0 

Ala Ala Gys Arg 
455 

Xaa Xaa Ser Gys 
470 

Ala Gys Xaa Xaa 
485 

Gys Glu Ala Pro 

Asn Xaa Xaa Xaa 

52 0 

Ala Arg Glu Pro 
535 

Xaa Xaa Lys Xaa 



e q list 
170 

Ser Ser Lys Ala 
185 

Xaa Xaa Xaa Xaa 

Thr Pre Pro Ser 
220 

Xaa Xaa Xaa Phe 
235 

Pro Xaa Xaa Xaa 

2 50 

lie Trp Glu Ser 
2 65 

Xaa Xaa Xaa X a a 

Met Gly Ser Ser 

3 00 

Gly Phe Gin Tyr 
315 

Xaa Xaa Xaa Xaa 

330 

Thr Pro Trp Ala 
345 

Xaa Xaa Xaa Xaa 

Pro Gys Gly Val 
330 

Thr Asn Val Val 
395 

Xaa Xaa Xaa Xaa 

410 

Gly Ser Ser Asp 
425 

Xaa Xaa Arg Thr 

Leu Xaa Xaa Xaa 
460 

Gly Asn Thr Leu 
475 

Xaa Xaa Xaa Xaa 

4 90 

Gly Leu His Asp 

b0 5 

Xaa Xaa Xaa Xaa 

Lys Arg Thr Arg 
540 

Leu Arg Val Phe 
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17 5 

Val Asn His He 
190 

a a a /.a a /.a a a a a 

2 0 5 

Trp Gin Lys Met 

Leu Gys Pro Thr 
24 0 

X a a X a a Xaa X a a 

2 55 

Val Tyr Ala Arg 

2 7 0 

Asn Tyr Asp Val 
2 8 5 

Tyr Xaa Xaa Xaa 

Ser Pro Gly Gin 
32 0 

Xaa Xaa Xaa X a a 

335 

Phe His Met Thr 

3 50 

Xaa Xaa Xaa Arg 
365 

Asn Xaa Xaa Xaa 

Thr Trp Pro Pro 
400 

Lys Xaa Xaa Xaa 
415 

Phe Lys Arg Arg 
430 

Ala Val lie Ala 
44 5 

Xaa Xaa Xaa Xaa 

Thr Gys Tyr Leu 
430 

Xaa Xaa Xaa Xaa 
4 95 

Ala Arg Glu Arg 
510 

Xaa Xaa Ser Ser 

525 

Arg Ala Xaa Xaa 
Thr Glu Ala Met 



s e q list 

545 550 555 560 

Thr Arg Tyr Ser Ala Pro Pro Gly Ma a Xaa 

5 0 5 o '"/ 0 



<210> 4 
<21l:> 579 
<212> ?RT 

<213'-' Artificial Sequence 
<2 2 0 > 

<223> DNA encoding fusion protein consisting of a 

portion of HCV polymerase and histidine tag at the 
C- terminus 



<4 00> 4 



Met 


Ser 


Met 


Ser 


Tyr 

c 


Thr 


Trp 


Thr 


Gly 


Ala 

1 0 


Leu 


lie 


Thr 


Pro 


Gys 
1 5 


Ala 


Ala 


Glu 


Glu 


Ser 
2 0 


Lys 


Leu 


Pro 


lie 


Asn 

25 


Ala 


Leu 


Ser 


Asn 


Ser 
3 0 


Leu 


Leu 


Arg 


His 


His 
3 5 


Asn 


Met 


Val 


Tyr 


Ala 


Thr 


Thr 


Ser 


Arg 


Ser 
4 5 


Ala 


Gly 


Leu 


Arg 


Gl r 
50 


T , \/ ^ 
u y o 


T , v ^ 
j-j y o 


Val 


Thr 


P he 

5 5 


Asp 


Arg 


Leu 


G 1 n 


Val 


Leu 


Asp 


Asp 


H L s 


Tyr 


Arg 


Asp 


Val 


Leu 


Lys 


Glu 


Met 


Lys 


Ala 


Lys 


Ala 


Ser 


Thr 


Val 


Lys 


0 3 






























j J 


Ala 


Lys 


Leu 


Leu 


Ser 

8 5 


Val 


Glu 


Glu 


Ala 


Gys 
9 0 


Lys 


Leu 


Thr 


Pro 


? ro 
9 5 


H Ls 


Ser 


Ala 


Lys 


Ser 
100 


Lys 


Phe 


Gly 


Tyr 


Gly 
105 


Ala 


Lys 


Asp 


Val 


Arc 
110 


Asn 


Leu 


Ser 


Ser 


Lys 
115 


Ala 


Val 


Asn 


His 


lie 

12C 


His 


Ser 


Val 


Trp 


Lys 
125 


Asp 


Leu 


Leu 


Glu 


Asp 
130 


Thr 


Val 


Thr 


Pro 


lie 
135 


Asp 


Thr 


Thr 


He 


Met 

140 


Ala 


Lys 


Asn 


G Lu 


Val 


Phe 


Gys 


Val 


Gin 


Pro 


Glu 


Lys 


Gly 


Gly 


Arq 


Lys 


P r o 


Ala 


Arg 


Leu 


145 










150 










15 5 










160 


He 


Val 


Phe 


Pro 


Asp 
1 05 


Leu 


Gly 


Val 


Arg 


Val 
170 


Gys 


Glu 


L y s 


Met 


Ala 
175 


Leu 


Tyr 


Asp 


Val 


Val 

13 0 


Ser 


Thr 


Leu 




Gin 
185 


Val 


Val 


Met 


G.l y 


Ser 

190 


Ser 


Tyr 


Gly 


Phe 


G 1 n 
195 


Tyr 


Ser 


Pre. 


G 1 y 


Glr. 

2 0 0 


Arg 


Val 


G 1 u 


Phe 


Leu 
205 


Val 


Asn 


Thr 


Trp 


Lys 

210 


Ser 


Lys 


Lys 


Asn 


Pro 

215 


Met 


Gly 


Phe 


Ser 


Tyr 

2 20 


Asp 


Thr 


Arg 


Gys 


Phe 


Asp 


Ser 


Thr 


Val 


Thr 


Glu 


Asn 


Asp 


Tie 


Arg 


Val 


Glu 


Glu 


Ser 


lie 


225 










230 










2 35 










24 0 


Tyr 


Gin 


Gys 


Gys 


Asp 
2 45 


Leu 


Ala 


Pro 


Glu 


Al a 

2 5 0 


Arg 


Gin 


Ala 


He 


Lys 

2 55 


Ser 
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s e q list 

Leu Thr Glu Arg Leu Tyr lie Gly Gly Pro Leu Thr Asn Ser Lys Gly 

2 6 0 2 0 5 2 7 0 

Gin Asn Gys GLy Tyr Arg Arcr Gys Arg Ala Ser Gly Val Leu Thr Thr 

275 " 2 h j " 285 

Ser Gys Gly Asn Thr Leu Thr Gys Tyr Leu Lys Ala Ser Ala Ala Gys 

29 0 29 5 30 0 

Arg Ala Ala Lys Leu Gin Asp Gys Thr Met Leu Val. Asn Gly Asp Asp 
30 5 ' 310 3 15 32 0 

Leu Val Val lie Gys Glu Ser Ala Gly Thr Gin Glu Asp Ala Ala Ser 

325 330 335 

Leu Arg Val Phe Thr Glu Ala Met Thr Ara Tyr Ser Ala Pro Pro Gly 

340 345 " 350 

Asp Pro Pro Gin Pro Glu Tyr Asp Leu Glu Leu lie Thr Ser Gys Ser 

355 300 365 

Ser Asn Val Ser Val Ala His Asp Ala Ser Gly Lys Ara Val Tyr Tyr 

37 0 37 5 3c 0 

Leu Thr Arg Asp Pro Thr Thr Pro Leu Ala Arg Ala Ala Trp Glu Thr 
395 " 390 395 400 

Ala Arg His Thr Pro Val Asn Ser Trp Leu Gly Asn lie Lie Met Tyr 

4 05 410 415 

Ala Pro Thr Leu Trp Ala Arg Met He Leu Met Thr His Phe Phe Ser 

420 " 425 430 

He Leu Leu Ala Gin Glu Gin Leu Glu Lys Ala Leu Asp Gys Gin He 

435 440 44 5 

Tyr Gly Ala Gys Tyr Ser lie Glu Pre Leu Asp Leu Pro Gin lie He 

4 5 0 4 55 4 6 0 

Glu Arg Leu His Gly Leu Ser Ala Phe Ser Leu His Ser Tyr Ser Pro 
465 470 475 480 

Gly Glu He Asn Arg Val Ala Ser Gys Leu Arg Lys Leu Gly Val Pro 

4 85 4 90 4 95 
Pro Leu Arg Val Trp Arg His Arg Ala Arg Ser Val Arg Ala Arg Leu 

5 0 0 5 0 5 51 0 

Leu Ser (Sin Gly Gly Arc Ala Ala Thr Gys Gly Lys Tyr Leu Phe Asn 

515 " 520 525 

Trp Ala Val Lys Thr Lys Leu Lys Leu Thr Pro He Pro Ala Ala Ser 

53 0 53 5 54 0 

Gin Leu Asp Leu Ser Gly Trp Phe Val Ala Gly Tyr Ser Gly Gly Asp 
545 550 555 560 

He Tyr His Ser Leu Ser Arg Ala Arg Pro Arg Gly Ser His His His 

5 05 " 57 0 " 57 5 

His His His 



<210> 5 
<211> 30 
<212:> DNA 

<213> Artificial Sequence 
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seq list 



<2 2 0> 

<2 2 1> prime r__bind 
<222> (1) ... (30) 

<223> Artificially synthesized primer sequence, SBNdelFW 

<225> E>NA encoding fusion protein consisting cf a 

portion of HCV polymerase and histidine tag at the 
C-terminus 



<4 00> 5 

catatgtcaa tgtcctacac at ggacagcc 

< 2 1 0 > 6 
<2il> 57 
<212> E'NA 

<2 1.3> Artificial Sequence 



30 



<220> 

<22 1 > primer_bind 
<222> (1) ... (57) 

<223> Artificially synthesized primer sequence, 5B570HRV 

<223> DNA encoding fusion protein consisting of a 

portion of HCV polymerase and histidine tag at the 
C-termi nus 

< 4 0 0 > 6 

ttattagtga tggtgatggt gatgggatcc gcggggtcgg gcacgagaca ggctgtg 57 

<210> 7 
<211> 57 
<212> E'NA 

<213> Artificial Sequence 
<220> 

<2 2 1> prime r_bind 
<222> (1) . . .757) 

<223> Artificially synthesized primer sequence, 5B552HRV 

<223> E'NA encoding fusion protein consisting of a 

portion of HCV polymerase and histidine tag at the 
C-terminus 

<400> 7 

ttattagtga tggtgatggt gatgggatcc aacgaaccag ccggacaagt ccagctg 57 
<210> c 
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seq list 

<211> 57 
<212> DNA 

<213> Artificial Sequence 
<220> 

<221> primer _bind 
<222> (1) . . .'(57) 

<223> Artificially synthesized primer sequence, 5B544HRV 

<22 3> DNA encoding fusion protein consisting of a 

portion of HCV polymerase and hiszidine tag at the 
C- terminus 



< 4 0 0 > 8 

ttattagtga tggtgatggt gatgggatcc ctgggacgca gccgggattg gagtgag 57 

<210> 9 
<211> 67 
<212> E>NA 

<213> Artificial Sequence 
<220> 

<22 i> primer_bind 
<222> (1) . . .767) 

<223> Artificially synthesized primer sequence, 
5B536HRV 

<223> Artificially synthesized primer sequence 5B531HRV 
<4 00> 9 

ttattagtga tggtgatggt gatgggatcc gagtttgagt ttggtcttca ctgcccagtt 60 
gaagagg 67 

<210> 10 
<211> 60 
<212> DNA 

<213> Artificial Sequence 
<220> 

<221> prime r_bind 
<222> (1) ... (60) 

<223> Artificially synthesized primer sequence, 
5B531HRV 

<223> Artificially synthesized primer sequence 5B591HRV 
<400> 10 

ttattagtga tggtgatggt gatgggatcc cttcactgcc cagttgaaga ggtacttgcc 60 
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seq list 



<210> 11 

<211> 52 

<212> DNA 

<213> Artificial 



Sequence 



< 2 2 0 > 

<2 2 1> primer_bind 
<222> ; 1.) . . .752) 

<223> Artificially synthesized primer sequence, 
5B591HRV 

<223> Artificially synthesized primer sequence 

< 4 0 0 > 1 1 

ttattaacgg tgatggtgat ggtgtccgga tcgattgggg agcaggtaga tg 



<210> 12 
<211> & 
<212> F'RT 
<213> Hepatitis 

< 2 2 1 J > 

<221> VARIANT 

<222> 1, 3 

<2 2 3> Xaa = Any 

<221> VARIANT 

< 2 2 2 > 1, 3 

<22 3> Xaa = Any 

<221> VARIANT 

<222> 1, 3 

<2 2 3> Xaa = Any 

<221> VARIANT 

<222> I , 3 

<22 3> Xaa - Any 

<400> 12 

Xaa Asp Leu Ser 



C Virus 

Amino Acid 

Amino Acid 

Amino Acid 

Amino Acid 

Gly Trp Phe Xaa 



<2K)> 13 
<211> 8 
<212> PRT 
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seq list 

<213> Hepatitis C Virus 
<400> 13 

Lys Asp Leu Ser Gly Trp Phe Lys 
1 5 
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